Computerized, Copy-Detection and Discrimination Apparatus and Method

ABSTRACT

An engine identifying segments or portions of one source material or source file common to or found in another source material or file. The engine may receive a first data stream in binary form as well as a second stream in binary form. The engine may include a data stream processor or pre-processor programed to translate the first and second data streams to generate respective first and second processed data streams. The commonality between the first and second processed data streams may be greater than the commonality between the first and second data streams themselves. Also, a comparator may be programmed to compare the first and second process data streams and identify binary segments found in both the first and second processed data streams.

BACKGROUND

1. The Field of the Invention

This invention relates to computerized detection and enforcement toolsfor protection of intellectual property rights and, more particularly,to novel systems and methods for detecting unauthorized copying ofintellectual property.

2. The Background Art

In certain situations, it is a relatively simple endeavor to determinewhen another is copying or otherwise infringing on one's intellectualproperty rights. For example, if one were an owner of a patent, onemight find in the market place and purchase the product of a potentialinfringer. That product may then be compared at the convenience of thepatent owner against the patent owner's claims. Similarly, if one has acopyrighted work one may often purchase the works or publications ofanother and compare them to determine whether copyright infringement hasoccurred.

However, certain materials may not lend themselves to a simpledetermination of patent infringement, copyright infringement, or otherintellectual property violation. For example, a particular softwarepackage completed and provided in an executable form will typically notidentify the source code from which the executable was derived. Suchsource code is typically only obtained after litigation has beeninitiated and the source material or source code has been produced indiscovery. Discovery proceedings often occur later as opposed to earlierin a lawsuit, and virtually never before. Accordingly, one may notdiscover whether or not a new issue of copyright infringement ofintellectual property has occurred until much effort (read: money) hasbeen expended in the suit.

Similarly, for other digital media such as sound recordings, differentsampling techniques and modifications may mask the fact that onerecording originates in another recording or other work protected bycopyright registrations or the like. Also, in certain applications, thesheer volume of electronic applications being transferred does not lenditself to ready examination or comparison to identify the details of thecontent of that traffic. A prodigious effort would be required todetermine whether that content represents a misappropriation or otherviolation of some intellectual property right. As a result, significantviolations of intellectual property rights may pass undetected.

Accordingly, what is needed is a system providing rapid analysis ofdigital traffic or data streams to provide information regardingmathematically recognizable patterns in the content thereof. Inparticular, one needs automated review for assessing violations of theintellectual property rights of others. Additionally, what is needed isa method of reducing or translating digital content to a commondominator or simplified structural form so that it may be readilycompared to other similarly reduced, transformed, or translatedmaterial. Such a system may enable and improve online detection andenforcement of intellectual property rights in the digital area wheremassive volumes of data pass through servers, undetectable except in themost blatant and obvious cases.

BRIEF SUMMARY OF THE INVENTION

In view of the foregoing, in accordance with the invention as embodiedand broadly described herein, a method and apparatus are disclosed inone embodiment of the present invention as including an engine receivingsource material or source files, processing that source material, andproducing a useful output therefrom. In general, source material may beany binary data stream. An engine may be configured to identify segmentsor portions of one source common to, or also found in, another source.Such commonality may occur when two or more sources share a commonorigin. Accordingly, an engine may assist in protecting and enforcingintellectual property rights in source material

In selected embodiments, an engine may include one or morepre-processors, comparators, post-processors, user interfaces, anddatabases. A pre-processor may be configured to reduce source materialto its “common denominator.” That is, source material may be of asimilar type, yet be dissimilar in form. Accordingly, a pre-processormay work to translate both multiple materials or files to a common formto facilitate direct comparison one against the other.

A comparator may be configured to identify segments common to more thanone source. For example, a comparator may identify all segmentscomprising at least a certain length common to both a first source and asecond source. A post-processor may be configured to convert theinformation obtained by the comparator into the output desired by theuser. For example, a post processor may generate a report listing thesegments identified by the comparator.

A user interface may be configured to support interaction between a userand the other components of an engine. For example, through a userinterface, a user may dictate the minimum length for segments identifiedby the comparator. In selected embodiment, certain information may bepassed directly from one component to another. Alternatively, a databasemay be configured to store and recall information used by the componentsof an engine. For example, a database may store one or more processeddata streams generated by a pre-processor.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing features of the present invention will become more fullyapparent from the following description and appended claims, taken inconjunction with the accompanying drawings wherein like structures areidentified by like numerals throughout. Understanding that thesedrawings depict only typical embodiments of the invention and are,therefore, not to be considered limiting of its scope, the inventionwill be described with additional specificity and detail through use ofthe accompanying drawings in which:

FIG. 1 is a schematic block diagram of one embodiment of a computersystem implementing an apparatus and method in accordance with thepresent invention;

FIG. 2 is a schematic block diagram of an intellectual propertyenforcement engine or “discrimination engine” in accordance with thepresent invention;

FIG. 3 is a schematic block diagram of one embodiment of a pre-processorof an engine in accordance with the present invention;

FIG. 4 is a schematic block diagram of one embodiment of a comparator ofan engine in accordance with the present invention;

FIG. 5 is a schematic block diagram of one embodiment of apost-processor of an engine in accordance with the present invention;

FIG. 6 is a schematic block diagram of a method for computerizedcopyright protection in accordance with the present invention;

FIG. 7 is a schematic block diagram of an alternative method forcomputerized protection of intellectual property rights in accordancewith the present invention; and

FIG. 8 is a schematic block diagram illustrating some of the locationswhere one embodiment of an engine, in accordance with the presentinvention, may be applied to a computer network.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

It will be readily understood that the components of the presentinvention, as generally described and illustrated in the drawingsherein, could be arranged and designed in a wide variety of differentconfigurations. Thus, the following more detailed description ofselected embodiments of a system and method implemented in accordancewith the present invention, as represented in the drawings, is notintended to limit the scope of the invention, as claimed, but is merelyrepresentative of various embodiments of the invention. The illustratedembodiments of the invention will be best understood by reference to thedrawings, wherein like parts are designated by like numerals throughout.

Referring to FIG. 1, an apparatus 10 or system 10 for implementing thepresent invention may include one or more nodes 12 (e.g., clients 12,computers 12). Such nodes 12 may contain one or more processors 14 orCPU's 14. A CPU 14 may be operably connected to a memory device 16. Amemory device 16 may include one or more devices such as a hard drive 18or other non-volatile storage device 18, a read-only memory 20 (ROM 20),and a random access (and usually volatile) memory 22 (RAM 22 oroperational memory 22). Such components 14, 16, 18, 20, 22 may exist ina single node 12 or may exist in multiple nodes 12 remote from oneanother.

In selected embodiments, the apparatus 10 may include an input device 24for receiving inputs from a user or from another device. Input devices24 may include one or more physical embodiments. For example, a keyboard26 may be used for interaction with the user, as may a mouse 28 orstylus pad 30. A touch screen 32, a telephone 34, or simply atelecommunications line 34, may be used for communication with otherdevices, with a user, or the like. Similarly, a scanner 36 may be usedto receive graphical inputs, which may or may not be translated to otherformats. A hard drive 38 or other memory device 38 may be used as aninput device whether resident within the particular node 12 or someother node 12 connected by a network 40. In selected embodiments, anetwork card 42 (interface card) or port 44 may be provided within anode 12 to facilitate communication through such a network 40.

In certain embodiments, an output device 46 may be provided within anode 12, or accessible within the apparatus 10. Output devices 46 mayinclude one or more physical hardware units. For example, in general, aport 44 may be used to accept inputs into and send outputs from the node12. Nevertheless, a monitor 48 may provide outputs to a user forfeedback during a process, or for assisting two-way communicationbetween a processor 14 and a user. A printer 50, a hard drive 52, orother device may be used for outputting information as output devices46.

Internally, a bus 54, or plurality of buses 54, may operablyinterconnect a processor 14, memory devices 16, input devices 24, outputdevices 46, network card 42, and port 44. The bus 54 may be thought ofas a data carrier. As such, the bus 54 may be embodied in numerousconfigurations. Wire, fiber optic line, wireless electromagneticcommunications by visible light, infrared, and radio frequencies maylikewise be implemented as appropriate for the bus 54 and the network40.

In general, a network 40 to which a node 12 connects may, in turn, beconnected through a router 56 to another network 58. In general, nodes12 may be on the same network 40, adjoining networks (i.e., network 40and neighboring network 58), or may be separated by multiple routers 56and multiple networks as individual nodes 12 on an internetwork. Theindividual nodes 12 may have various communication capabilities. Incertain embodiments, a minimum of logical capability may be available inany node 12. For example, each node 12 may contain a processor 14 withmore or less of the other componentry described hereinabove.

A network 40 may include one or more servers 60. Servers 60 may be usedto manage, store, communicate, transfer, access, update, and the like,any practical number of files, databases, or the like for other nodes 12on a network 40. Typically, a server 60 may be accessed by all nodes 12on a network 40. Nevertheless, other special functions, includingcommunications, applications, directory services, and the like, may beimplemented by an individual server 60 or multiple servers 60.

In general, a node 12 may need to communicate over a network 40 with aserver 60, a router 56, or other nodes 12. Similarly, a node 12 may needto communicate over another neighboring network 58 in an internetworkconnection with some remote node 12. Likewise, individual components mayneed to communicate data with one another. A communication link mayexist, in general, between any pair of devices.

Referring to FIG. 2, an apparatus 10 may support a discrimination engine62, simply referred to as an engine 62, in accordance with the presentinvention. An engine 62 may receive source material 64 and process thatsource material 64 to produce a useful output 66 therefrom, for example,discriminating patterns otherwise not readily detectable. In general,source material 64 may be any binary data stream. For example, sourcematerial 64 may be image data 64 a, music 64 b, text 64 c, video 64 d,executable software 64 e, or some other 64 f data represented as abinary data stream.

An engine 62 in accordance with the present invention may be configuredto identify patterns of the actual content of segments or portions ofone source 64 common to, or also found in, another source 64. Suchcommonality may occur when two or more sources 64 share a common origin.Accordingly, in selected embodiments, an engine 62 in accordance withthe present invention may assist in protecting and enforcingintellectual property rights in source material 64.

For example, an engine 62 may determine when one executable softwareprogram 64 e or file 64 e derived or copied from some portion orportions of another executable software program 64 e or file 64 e. Oncesuch a link is established, appropriate action may be taken to protectthe underlying executable software 64 e and end any improper use thereofby an unauthorized party.

In selected embodiments, an engine 62 may include one or morepre-processors 68, comparators 70, post-processors 72, user interfaces74, and databases 76. While the functionality between the variouscomponents 68, 70, 72, 74, 76 of an engine 62 may vary, each component68, 70, 72, 74, 76 may be any arrangement or combination of hardware,software, or hardware and software configured to provide that desiredfunctionality

In general, a pre-processor 68 may be configured to reduce sourcematerial 64 to its “common denominator.” That is, source material 64 maybe of a similar type, yet be dissimilar in actual form. For example, afirst music file 64 a may be a digital recording of a particularperformance taken at a particular sampling frequency. A second musicfile 64 a may originate with that same digital recording of the sameperformance, but be compressed or sampled at a different frequency.Accordingly, a pre-processor 68 may operate to translate both the firstand second files 64 a to a common form (e.g., compression, frequency,and the like) to facilitate direct comparison one against the other.

A comparator 70 may be configured to detect and identify segments commonto more than one source 64. For example, a comparator 70 may identifyall common segments comprising at least a certain selected length (e.g.,number of bytes) found to be common to both a first source 64 e and asecond source 64 e. Thus, no need for source code or human comparisonmay be required.

A post-processor 72 may be configured to convert the informationobtained by the comparator 70 into the output 66 desired by the user.For example, a post processor 72 may generate a report listing thecommon segments identified by the comparator 70. As a particular matter,a post-processor 72 may facilitate more aggressive assertion andprotection of intellectual property rights. In general, the capabilityof the post-processor 72 may vary according to the sophistication orcomplexity of the output 66 desired by the user.

A user interface 74 may be configured to support interaction between auser and the other components 68, 70, 72, 76 of an engine 62. Forexample, through a user interface 74, a user may dictate a minimumlength for segments to be tested and identified by the comparator 70.Larger segments may be easier to detect or discriminate rapidly. Shortersegments may provide a more thorough survey. Through a user interface74, a user 74 may dictate the format, presentation, visualization, orother type of output 66 generated by the post-processor 72. Similarly,through a user interface 74, a pre-processor 68 may inform a user of thevarious “translation” or transformation procedures applied to “reduce”the source material 64 to a more useful, comparable, or other desiredform.

In selected embodiments, certain information may be passed directly fromone component to another. For example, upon completion of its work, apre-processor 68 may pass a processed data stream 78 to a comparator 70.The comparator 70 may then hold the processed data stream 78 until itreceives another processed data stream 78 suitable for conducting acomparison.

Alternatively, a database 76 may be configured to store and recallinformation 78, 80 used by the components 68, 70, 72, 74 of an engine62. For example, a database 76 may store one or more processed datastreams 78 generated by a preprocessor 68. Accordingly, comparator 70may compare the one or more previously processed sources 78 a, 78 b, 78n to identify common segments.

In still other embodiments, certain information may be passed fromcomponent to component (e.g., from pre-processor 68 to comparator 70),while other information is stored in a database 76. For example, uponcompletion of its processing, a pre-processor 68 may pass its output asa processed data stream 78 to a database 68 for storage and recall. Thepre-processor 68 may then output data suitable to point or otherwiseinform a comparator 70 of the location of the new processed data stream78 in a memory location, file, register, buffer, or the like.Accordingly, the comparator 70 may compare the new processed data stream78 with others 78 previously identified and stored in some memorylocation, file, or the like inside or indicated by the database 76.

In general, any information used by the engine 62 may be stored in oridentified by data in a database 76. In addition to the processed datastreams 78, other information 80 stored in a database 76 may includecommon segments identified by a comparator 70, reports generated by anpost-processor 72, default settings received from a user through a userinterface 74, and the like.

Referring to FIG. 3, a pre-processor 68 may be configured in anysuitable manner to reduce or translate source material 64 to its “commondenominator.” The configuration may largely depend on the nature of thesource material 64 being so transformed, reduced, or translated. Theconfiguration may also depend on whether the engine 62 processes sources64 of a single form or diverse sources 64 having disparate forms.Engines 62 configured to analyze diverse source materials 64 may beconfigured with the recognition tools and methods suited to an array ofsource materials 64.

In selected embodiments, a pre-processor 68 may include a targetselection function 82 and a translation function 84. A target selectionfunction 82 in accordance with the present invention may implement thedesires of a user (as communicated in inputs received from a userinterface 74), default settings, or the like in determining what portionof a particular source 64 is passed on for analysis and furtherprocessing. Some factors that may be considered in selecting ordesigning a target selection method may include the size (e.g., numberof bytes) of a source file 64, processing and memory capacity of theengine 62, processing and memory capacity of various components of theengine 62 (e.g., the comparator 70), nature or fundamentalcharacteristics of the source material 64, or the like.

For example, for certain sources 64, a pass-through option 86 may beselected or applied. In such an option 86, the target selected foranalysis and further processing may be the entire source 64 in the sameform in which it was received by the pre-processor 68. Such an option 86may be described as a static selection method and may be well suited fornon-executable source files 64 of relatively small size.

Alternatively, a sampling process 88 may be applied in which certainportions of the source 64 are selected for analysis and furtherprocessing. These portions, however, may be in the same form in whichreceived by the pre-processor 68. For example, for executable software,a sampling method 88 may select a sample based on a load mapcorresponding thereto. In one embodiment, for example, a targetselection function 82 may use a load map to select a sample avoiding alllibraries or similar functions not likely to be of interest.

In other applications, a target selection function 82 may be programmed,set, or otherwise instructed to apply an execution or interpretationanalysis 90 to a particular source file 64 or source material 64. Suchan analysis may be considered a dynamic selection method 90. Forexample, executable code 64 e may be executed and the resultingstep-by-step instructions may be recorded. That is, the sample made foranalysis and further processing may be generated by executing a datastream 64 and recording in substantially real time the processinginstructions produced thereby as that sample. Such an approach maygreatly expand the amount of material available for analysis and providegreater insight into the origins and functionality of source material 64so processed.

In still other applications, other options 92 or methods 92 may beapplied to select an appropriate sample for analysis and furtherprocessing. For example, a hybrid approach may be applied, combiningaspects of sampling and dynamic methods 88, 90. For example, a sourcefile 64 may be sampled according to the sampling method 88. Thereafter,those samples may be dynamically processed and recorded according to thedynamic method 90.

A translation function 84, or transformation 84 in accordance with thepresent invention may implement the desires of a user, default settings,or the like in determining what reduction or translation processes toapply. In general, a reduction or translation process in accordance withthe present invention may be any process that reduces, transforms, ortranslates the sample selected by the target selection function 82 to aform in which it may be meaningfully compared to other samples ofsimilar form and having characteristics of interest. In other words, areduction or translation process may increase the commonality betweentwo processed data streams above the commonality otherwise between thesource materials 64 corresponding to the two processed data streams.This may be done while preserving the integrity (e.g., underlyingmeaning, characteristics, effect, etc.) of the source material 64.

For example, for certain sources 64, a pass-through option 94 may beselected or applied. In such an option 94, the source material 64 may beleft substantially unaltered. Such an option 94 may be appropriate whena source file 64 is already in a desired form.

In selected embodiments, a translation function 84 may apply a registertransfer language transform 96 or the like. A register transfer languagemay be used to describe operations (upon registers) caused by theexecution of each instruction. When applied, a register transferlanguage transform 96 may tend to remove variations caused by differentcompilation schemes.

That is, two compilers may compile the same source code according todifferent schemes, resulting in two different executable softwareprograms. This may mask the fact that both programs originated with thesame source code. Application of a register transfer language transform96 in accordance with the present invention may convert the executablesoftware 64 e into a common form independent of compilation scheme.

In certain embodiments, a translation function 84 may apply ade-optimizer 98. When applied, a de-optimizer 98 may undo anyoptimization embodied in a source file 64. This may be done withoutregard to whether the optimization originated with a compute-basedoptimizer or in the choices of a programmer. For example, code may bewritten by a programmer to contain an instruction set of “A+B=C,A+B+C=D, C+D=E.” Such a set may include some optimization. That is,C+D=E would be an optimization over A+B+D=E. However, optimizationsoftware may further optimize the set by changing “A+B+C=D” to “C+C=D.”

Without regard to where the various optimizations originated, ade-optimizer 98 in accordance with the present invention may reduce ortranslate the instruction set to provide “A+B=C, A+B+C=D, A+B+D=E.”Accordingly, differences in source material 64 caused by optimizationmay be removed, leaving only the underlying, fundamental operations.Like factoring a polynomial, de-optimization reduces complexity to abasic, simple set of elements that compare more readily.

In still other applications, other options 100 or methods 100 may beapplied to reduce the sample selected by a target selection function 82to a form in which it may be meaningfully compared to other samples ofsimilar form. For example, a hybrid approach may be applied, combiningaspects of a register transfer language transform 96 and a de-optimizer98. Alternatively, other methods 100 more suited for the particularsource material 64 may be implemented. For example, a translationfunction 84 may apply a frequency transform, a compression transform, orthe like.

After passing through a pre-processor 68 in accordance with the presentinvention, a particular source material 64 or file 64 may becharacterized as a processed data stream 78 or a processed source file78. Such processed streams 78, files, etc. may be passed to a database76 for storage, passed to a comparator 70 for further analysis, orotherwise used or stored.

Referring to FIG. 4, a comparator 70 may be configured in any suitablemanner to identify segments common to more than one source 64. Inselected embodiments, a comparator 70 may include a comparison function102 and a segment compilation function 104. A comparison function 102 inaccordance with the present invention may implement the desires of auser, default settings, or the like in identifying segments common tomore than one source 64. For example, a comparison function 102 maypresent a user with the option of conducting a byte-for-byte comparison106 between two processed source materials 64. Alternatively, acomparison function 102 may support a “sliding window” comparison 108,or some other suitable comparison method 110.

In a sliding window comparison 108, a minimum segment size is determinedby preprogramming, selection by a user, or the like. For example, aminimum segment size may be set at some number of bytes. The comparisonfunction 102 may then apply a “window” covering (e.g., considering) thatnumber of bytes to each of the processed source materials 78 beingcompared. The windows may then proceed to move forward, byte-by-byte,along the source materials 78.

When the contents of one window matches the contents of another window,the size of the window of comparison may be increased until the windowsno longer match. Thus, the full extent of the commonality of thesegments may be determined. Thereafter, the windows may be reduced tothe original size and continue the byte-by-byte march to locate the nextsegment in common. In such a manner, all segments above a specifiedlength that are common to the processed source materials 78 may beidentified.

In selected embodiments, a segment compilation function 104 may collectall the common segments 112 (i.e., segments 112 a, 112 b, 112 c found ineach of a particular plurality of source materials 78). The commonsegments 112 may then be passed on to a post-processor 72, reported tothe user through the user interface 74, stored in the database 76, orthe like, or some combination thereof.

Referring to FIG. 5, a post-processor 72 may be configured in anysuitable manner to convert the information obtained by the comparator 70into the output 66 desired by the user. In selected embodiments, variousfunctions or modules may be included within a post-processor 72 toprovide or generate the desired outputs 66.

For example, in one embodiment, a post-processor 72 may include a soundrecording function 114. A sound recording function 114 may receive asegment 112 and generate a sound recording 66 a. A segment 112 may be asegment identified by a comparator as being common or found in more thanone source 78. In selected embodiments, a sound recording function 114may create a sound recording 66 a from a segment 112 by adding a header118 and a trailer 120 identifying this segment 112 as a sound recording.For example, the header 118 and trailer 120 may instruct a readingcomputer or other electronic instrument to play back or otherwise treatthe segment 112 as it would a sound recording.

A post-processor 72 may include a copyright application function 122. Acopyright application function may receive inputs from a database oruser interface and apply those inputs to a standard copyrightapplication form corresponding to one or more countries. Accordingly, acopyright application function 122 may generate a copyright application66 b suitable for filing, or perhaps execution and filing, with thesuitable authorized body for a desired country.

In other embodiments, a post-processor 72 may include an infringementreport function 124. An infringement report function 124 may facilitategeneration of, or generate, an infringement report 66 c. An infringementreport 66 c may contain information listing or identifying the varioussegments 112 recordings 66 a or sound recordings 66 a that are beingcopied from one source material 78 to another.

In certain embodiments, a post-processor 72 may include acease-and-desist function 126. A cease-and-desist function 126 mayfacilitate generation of a cease-and-desist letter 66 d or othercommunication 66 d or notice 66 d. Such a letter 66 d may provide awarning or otherwise inform a potential infringer of the infringement orcopying of segments 112 identified by the comparator 70.

A post-processor 72 may also include a database maintenance function128. A maintenance function 128 may pass the outputs 66 generated bypost-processor 72 to a database 76 for storage. For example, a databasemaintenance function 128 may pass sound recordings 66 a to a database 76for storage and recall. Likewise, a database maintenance function 128may store copyright applications 66 b, infringement reports 66 c,cease-and-desist letters 66 d, or the like in a database 76 for futurereference or use. A post-processor 72 in accordance with the presentinvention may include other functional features or modules 140 thatprovide other useful outputs 66 e.

Referring to FIG. 6, an engine 62 may implement a method 132 inaccordance with the present invention. Such a method 132 may begin withobtaining 134 some piece of original source material 64. The sourcematerial 64 may be translated 136 to reduce the source material 64 to aform suitable for comparisons to source material 64 of other sources.

The method 132 may continue with obtaining 138 resource material 64 toevaluate. That new source material 64 may then also be translated 140.Once the original source material 64 and new source material 64 aretranslated they may be compared 142. The comparison process mayfacilitate identification 144 of common segments 112. The commonsegments 112 may be any part or portion of the original source material64 also found in the new source material 64. That is, for example thecommon segments 112 are common to both the original source material 64and the new source material 64.

Once the common segments 112 are identified 144, they may be converted146 into sound recordings 66 a. As sound recordings 66 a, the commonsegments 112 may be the subject of copyright protection. Accordingly, anengine 62 operating on behalf of an operator or user, may pursue 148copyright or other protection of the sound recordings 66 a. Oncecopyright or other protection of the sound recording 66 a is obtained,the protection may be enforced against the creator or user of new sourcematerial 64.

For example, the owner or licensee of the original source material 64may have an action for copyright infringement against the owner orlicensee of the new source material 64. Accordingly, the owner orlicensee of the original source material 64 may seek 150 the cooperationof the owner of the new source material 64.

The cooperation of the owner of the new source material may be sought150 in a number of ways. For example, cooperation may be sought 150through a cease-and-desist letter 152, through an offer to take alicense 154, through a request to pay damages 156, through litigation158 in federal or state courts, or the like, or through some other 160mechanism, tribunal, or the like.

Referring to FIGS. 7 and 8, an engine 62 in accordance with the presentinvention may be used to monitor traffic passing through a server 12,network 40, or the like. For example, in one method 162 in accordancewith the present invention, copyright monitoring software may beinstalled 164 on a computer attached to or otherwise monitoring a server12 or a network 40. The software may translate 166 a suite ofcopyrightable works. For example, the copyright software may reduce asuite or collection of source material 64 to a simplified or processedstate 78. Accordingly, one need only translate the new source material64, or source material 64 corresponding to someone other than the ownerof the original source material 64, and then compare that source 64 withthe now translated 166 suite of copyrightable works.

This may be done by monitoring 168 electronic traffic to locate matchingsegments 112. For example, one may procure the assistance of owners ormanagers of mail services 170. Accordingly, the engine 62 may monitormail traffic. In monitoring 128 such traffic, the engine 62 may identifyall or a sampling of content to detect common segments 112.

Alternatively, an engine 62 in accordance with the present invention maysearch 172 electronic resources to locate matching segments 112. Forexample, this may be done by crawling the internet 174 to locate commonsegments 112. Common segments 112 may be found in websites, inmusic-sharing locations, software-downloading locations, or the like.Once common segments 112 are identified, a method 162 may continue withthe identification 176 of the potentially infringing works and theirowners, users, sellers, buyers, and so forth. For example, the softwareor engine 62 may seek information corresponding to the common segments112 that would identify the owner thereof. Once the owners of thepotentially infringing works have been identified 176, the method 162may continue with the reporting 178 of potential infringers forenforcement action.

The present invention may be embodied in other specific forms withoutdeparting from its basic features or essential characteristics. Thedescribed embodiments are to be considered in all respects only asillustrative, and not restrictive. The scope of the invention is,therefore, indicated by the appended claims, rather than by theforegoing description. All changes that come within the meaning andrange of equivalency of the claims are to be embraced within theirscope.

1. a system comprising: a first data stream in binary form; a seconddata stream in binary form; a data stream processor programmed totranslate the first and second data streams to generate respective firstand second processed data streams, the commonality between the first andsecond processed data streams being greater than the commonality betweenthe first and second data streams; and a comparator programmed tocompare the first and second processed data streams and identify binarysegments common to both the first and second processed data streams. 2.The system of claim 1, wherein the first and second data streams bothcorrespond to one of images, music, text, video, and executables.
 3. Thesystem of claim 2, wherein: the first and second data streams haverespective sampling frequencies different from one another; the firstprocessed data stream comprises at least a portion of the first datastream; and the second processed data stream comprises at least aportion of the second data stream translated to a sampling frequencycommon to that of the first processed data stream.
 4. The system ofclaim 2, wherein: the data stream processor is further programmed toexecute at least one method to identify first and second samplesselected from the first and second data streams, respectively; and thedata stream processor is further programmed to translate the first andsecond samples to generate the first and second processed data streams.5. The system of claim 4, further comprising a user interface programmedto communicate instructions from a user to the data stream processor toselect which method of the at least one method is executed by the datastream processor.
 6. The system of claim 5, where a first method of theat least one method comprises selecting the entire first data stream asthe first sample and the entire second data stream as the second sample.7. The system of claim 6, wherein first and second data streams compriseexecutable code.
 8. The system of claim 7, where a second method of theat least one method comprises: executing the first data stream andrecording in substantially real time the processing instructionsexecuted thereby as the first sample; and executing the second datastream and recording in substantially real time the processinginstructions executed thereby as the second sample.
 9. The system ofclaim 8, wherein: the first processed data stream comprises the firstdata stream translated into register transfer language; and the secondprocessed data stream comprises the second data stream translated intoregister transfer language.
 10. The system of claim 9, wherein: thefirst processed data stream comprises a de-optimized version of thefirst data stream; and the second processed data stream comprises ade-optimized version of the second data stream.
 11. The system of claim10, further comprising a post-identification data stream processorprogrammed to convert the binary segments into digital sound recordingfiles.
 12. The system of claim 1, wherein: the first and second datastreams have respective sampling frequencies different from one another;the first processed data stream comprises at least a portion of thefirst data stream; and the second processed data stream comprises atleast a portion of the second data stream translated to a samplingfrequency common to that of the first processed data stream.
 13. Thesystem of claim 1, wherein: the first processed data stream comprises atleast a portion of a substantially real time recording of the processinginstructions executed by executing the first data stream; and the secondprocessed data stream comprises at least a portion of a substantiallyreal time recording of the processing instructions executed by executingthe second data stream.
 14. The system of claim 1, wherein: the firstprocessed data stream comprises the first data stream translated intoregister transfer language; and the second processed data streamcomprises the second data stream translated into register transferlanguage.
 15. The system of claim 1, wherein: the first processed datastream comprises a de-optimized version of the first data stream; andthe second processed data stream comprises a de-optimized version of thesecond data stream.
 16. The system of claim 1, further comprising apost-identification data stream processor programmed to convert thebinary segments into digital sound recording files.
 17. The system ofclaim 1, further comprising a user interface programmed to communicateinstructions from a user to the comparator to specify a minimum lengthof the binary segments identified
 18. A computer-implemented systemcomprising: a first data stream represented in a binary form; a seconddata stream represented in a binary form; a data stream processorprogrammed to identify a first sample corresponding to the first datastream and a second sample corresponding to the second data stream; thedata stream processor further programmed to translate the first andsecond samples to generate respective first and second processed datastreams corresponding thereto, wherein the commonality shared betweenthe first and second processed data streams being greater than thecommonality shared between the first and second data samples; acomparator programmed to compare the first and second processed datastreams and identify binary segments common to both the first and secondprocessed data streams; and a post-identification data stream processorprogrammed to convert the binary segments into digital sound recordingfiles.
 19. The system of claim 18, further comprising a user interfaceprogrammed to communicate instructions from a user to the comparator tospecify a minimum length of the binary segments identified.
 20. A systemcomprising: a first data stream in binary form; a second data stream inbinary form; a data stream processor programmed to identify a firstsample corresponding to the first data stream and a second samplecorresponding to the second data stream; the data stream processorfurther programmed to translate the first and second samples to generaterespective first and second processed data streams, the first and secondprocessed data streams sharing a commonality greater than that shared bythe first and second data samples; a comparator programmed to comparethe first and second processed data streams and identify binary segmentstherein having at least a minimum length and common to both the firstand second processed data streams; a user interface programmed tocommunicate instructions from a user to the comparator to specify theminimum length of the binary segments identified; a post-identificationdata stream processor programmed to convert the binary segments intodigital sound recording files; and the post-identification data streamprocessor further programmed to prepare at least one copyrightapplication petitioning copyright protection for at least one of thedigital sound recording files.